# Multi-task generalization

Acemath RL Nemotron 7B GGUF
Other
AceMath-RL-Nemotron-7B is a mathematical reasoning model trained entirely through reinforcement learning. It is trained based on Deepseek-R1-Distilled-Qwen-7B and performs excellently in mathematical reasoning tasks. It also has certain generalization ability in coding tasks.
Large Language Model Transformers English
A
Mungert
633
1
Gemma 3 12B FornaxV.2 QAT CoT Q4 0 GGUF
This is an experimental small reasoning model designed to run on 8GiB consumer-grade GPUs with general inference capabilities. Through supervised fine-tuning (SFT) and high-quality reasoning trajectory training, the model can generalize its reasoning abilities to multiple tasks.
Large Language Model
G
ConicCat
98
1
Bamba 9B V2
Apache-2.0
Bamba-9B-v2 is a decoder-only language model built on the Mamba-2 architecture, focusing on text generation tasks and outperforming Llama 3.1 8B in performance.
Large Language Model Transformers
B
ibm-ai-platform
3,634
15
T0 3B
Apache-2.0
T0++ is a natural language processing model based on the T5 architecture, achieving zero-shot task generalization through multi-task prompt training, outperforming GPT-3 on various NLP tasks while being more compact.
Large Language Model Transformers English
T
bigscience
3,723
100
Llama 3 Gutenberg 8B
Other
A fine-tuned model based on Llama-3-8b, optimized using the Gutenberg DPO dataset, suitable for text generation tasks.
Large Language Model Transformers
L
nbeerbower
18
9
Roberta Large Zeroshot V2.0 C
MIT
A RoBERTa-large model designed for efficient zero-shot classification, trained on commercially friendly data, capable of performing text classification tasks without training data.
Text Classification Transformers English
R
MoritzLaurer
53
2
Wizardlm 13B V1.2
WizardLM-13B V1.2 is a large language model trained on Llama-2 13b, focusing on complex instruction-following capabilities.
Large Language Model Transformers
W
WizardLMTeam
989
226
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase